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Abstract 

We present two restricted versions of one-tape Turing machines. Both character¬ 
ize the class of context-free languages. In the first version, proposed by Hibbard in 
1967 and called limited automata, each tape cell can be rewritten only in the first d 
visits, for a fixed constant d > 2. Furthermore, for d = 2 deterministic limited au¬ 
tomata are equivalent to deterministic pushdown automata, namely they characterize 
deterministic context-free languages. Further restricting the possible operations, we 
consider strongly limited automata. These models still characterize context-free lan¬ 
guages. However, the deterministic version is less powerful than the deterministic 
version of limited automata. In fact, there exist deterministic context-free languages 
that are not accepted by any deterministic strongly limited automaton. 


1 Introduction 

Despite the continued progress in computer technology, one of the main problems in de¬ 
signing and implementing computer algorithms remains that of finding a good compromise 
between the production of efficient algorithms and programs and the available resources. 
For instance, up to the 1980s, computer memories were very small. Many times, in order 
to cope with restricted space availability, computer programmers were forced to choose 
data structures that are not efficient from the point of view of the time required by their 
manipulation. Notwithstanding the huge increasing of the memory capacities we achieved 
in the last 20-30 years, in some situations space remains a critical resource such as, for 
instance, when huge graphs need to be manipulated, or programs have to run on some 
embedded systems or on portable devices. 

This is just an example to emphasize the relevance of investigations on computational 
models operating under restrictions, which was also one of the first research lines in com¬ 
putability and complexity theory. In particular, starting from the 1960s, a lot of work has 
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been done to investigate the minimal amount of resources needed by a machine in order 
to be more powerful than finite state devices. A natural formalization of these problems 
can be obtained in the realm of formal languages, by studying resource requirements for 
nonregular language recognition. Classical investigations in this field are related to time 
and space resources and produced some “gap results”. 

Let us start by briefly considering time. First of all, using a simple argument, one 
can prove that each language which is accepted by a Turing machine in sublinear time 
is regular. 1 Moreover, it should be clear that each regular language can be accepted in 
linear time by a Turing machine which never writes symbols, namely by a (two-way) finite 
automaton. However, by considering machines with a separate work tape, it is possible to 
recognize in linear time even nonregular languages, such as for instance {a n b n \ n > 0} or 
the Dyck language of balanced parentheses. 

When we consider one-tape Turing machines , namely machines with a single (infinite or 
semi-infinite) tape, which initially contains the input and which can be rewritten during the 
computation to keep information, 2 the question of the minimum amount of space needed 
to recognize nonregular languages becomes more interesting. In 1965 Hennie proved that 
deterministic one-tape Turing machines working in linear space are not more powerful 
than finite automata, namely, they can recognize only regular languages [ ]. Hence, the 
ability to store data on the tape is useless if the time is restricted to be linear. 3 Actually, 
this remains true even for some time bounds growing more than linear functions. In 
fact, as independently proved by Trakhtenbrot and Hartmanis, machines of this kind can 
recognize nonregular languages only if their running times grow at least as n log n. Hence, 
there is a gap between the time sufficient to recognize regular languages, which is linear, 
and the time necessary for nonregular language recognition. Furthermore, examples of 
nonregular languages accepted in time 0(nlogn ) have been provided, proving that the 
bound is optimal [28, 5]. 

Concerning the nondeterministic case, in 1986 Wagner and Wechsung provided a coun¬ 
terexample showing that the nlogn time lower bound cannot hold [29, Thm. 7.10]. This 
result was improved in 1991 by Michel, by showing the existence of NP-complete languages 
accepted in linear time [16]. Since there are simple examples of regular languages requiring 
linear time, we can conclude that, in the nondeterministic case, we do not have a time gap 
between regular and nonregular languages. 

However, this depends on the time measure we are considering. In fact, we can take 
into account all computations (strong measure), or we can consider, among all accepting 
computations on each input belonging to the language, the shortest one (weak measure). 
This latter measure is related to an optimistic view of nondeterminism: On a given input, 
when a nondeterministic machine guesses an accepting computation, it is also able to guess 
the shortest one. 

(Tee, e.g., [20]. 

2 As usual, the cells outside the input portion contain a special blank symbol. 

'Whose machines were called one-tape off-line Turing machines, where “off-line” means that all input 
symbols are available, on the tape, at the beginning of the computation and they can be read several 
times, if not rewritten. In contrast, in the 1960s the term “on-line” was used to indicate machines with 
a separate read-only one-way input tape: in this case the machine can read each input symbol only one 
time. So, if an on-line machine has to use the same input symbol more times, it needs to use space to save 
it somewhere. 
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weak 

Deterministic machines 

n log n 

nlogn 

Nondeterministic machines 

n log n 

n 


Table 1: Time lower bounds for the recognition of nonregular languages by one-tape Turing 
machines. The table should be read as follows: a row r denotes a type of machine while a 
column c a measure. If the element at the position (r, c) of the table is the function f(n), 
then t(n) ^ o(/(n)) for each one-tape off-line Turing machine of type r that recognizes a 
nonregular language in time t(n) under the measure corresponding to column c. All the 
bounds have been proved to be optimal. For a survey see [20]. 


The abovementioned results by Wagner and Wechsung and by Michel have been proved 
with respect to the weak measure. For the strong measure the situation is completely 
different. In fact, under this measure, the nlogn time lower bound for the recognition of 
nonregular languages holds even in the nondeterministic case, as proved in 2010 by Tadaki, 
Yamakami, and Lin [27]. Table 1 summarizes the above-discussed time lower bounds for 
the recognition of nonregular languages by one-tape Turing machines. 

Now, let us discuss space. It is easy to observe that Turing machines working in 
constant space can be simulated by finite automata, so they accept only regular lan¬ 
guages. In their pioneering papers, Hartmanis, Stearns, and Lewis investigated the mini¬ 
mal amount of space that a deterministic Turing machine needs to recognize a nonregular 
language [13, 21 ]. In order to compare machines with finite automata, counting only the 
extra space used to keep information, the machine model they considered has a work tape, 
which is separate from the read-only input tape. The space is measured only on the work 
tape. 

They proved that if the input tape is one-way, namely the input head is never moved 
to the left, then, in order to recognize a nonregular language, a logarithmic amount of 
space is necessary. Hence, there are no languages with nonconstant and sublogarithmic 
space complexity on one-way Turing machines. 

In the case of two-way machines, namely when machines can move the head on the 
input tape in both directions, the lower bound reduces to a double logarithmic function, 
namely a function growing as log log n. 4 

These results have been generalized to nondeterministic machines by Hopcroft and Ull- 
man under the strong space measure, namely by taking into account all computations [9]. 
The optimal space lower bound for nonregular acceptance on one-way nondeterministic 
machines reduces to log log n, if on each accepted input the computation using least space 
is considered (weak space), as proved by Alberts [ ]. For a survey on these lower bounds 
we point the reader to [ ]. 5 

Let us now consider linear space. It is well known that nondeterministic Turing ma¬ 
chines working within this space bound characterize the class of context-sensitive lan- 

4 Actually, in these papers the authors used the abovementioned terms on-line and off-line instead of 
one-way and two-way. 

5 Many interesting results concerning “low” space complexity have been proved. For surveys, see, e.g., 
the monograph by Szepietowski [26] and the papers by Michel [17] and Geffert [3], 
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guages. This remains true in the case of linear bounded automata, namely one-tape Turing 
machines whose work space is restricted to the portion of the tape which at the beginning 
of the computation contains the input, as proved in 1964 by Kuroda [11]. 

An interesting characterization of the class of context-free languages was obtained by 
Hibbard in 1967, considering a restriction of linear bounded automata, called scan limited 
automata or, simply, limited automata [ ]. A limited automaton is a Turing machine that 
can rewrite the content of each tape cell only during the first d visits, for a fixed constant d. 
It has been observed that the restriction of using only the portion of the tape which initially 
contains the input does not reduce the computational power of these models [22]. Then, 
limited automata can be seen as a restriction of linear bounded automata while, in turn, 
two-way finite automata, which characterize regular languages, can be seen as restrictions 
of limited automata. Hence, we have a hierarchy of classes of one-tape Turing machines 
which corresponds to the Chomsky hierarchy. 

This paper is devoted to the description of limited automata and one restricted ver¬ 
sion of them. In Section 2 we introduce the model, presenting some examples and some 
properties. In particular, we point out that a deterministic family of limited automata 
characterizes the class of deterministic context-free languages. We also discuss descrip- 
tional complexity results, comparing the size of the description of context-free languages 
by pushdown automata, with the size of the description by limited automata. 

Since we are interested in devices working with restricted resources or restricted op¬ 
erations, in Section 3 we will consider a machine model with a set of possible operations 
which further restricts the operations available on limited automata. This models is called 
strongly limited automata [21]. The idea of studying it was inspired by the fact that Dyck 
languages, namely languages of well balanced sequences of brackets, are recognized by 
some kinds of limited automata that use the capabilities of these devices in a restricted 
way. Furthermore, according to the Chomsky-Schiitzenberger representation theorem for 
context-free languages [2], the “context-free part” of each context-free language is repre¬ 
sented by a Dyck language. Indeed, even if they have severe restrictions, strongly limited 
automata still characterize the class of context-free languages. However, their determinis¬ 
tic version is weaker than deterministic pushdown automata, namely it cannot recognize 
all deterministic context-free languages. Even for these devices we discuss descriptional 
complexity aspects. 

In the final section, we briefly mention further models restricting one-tape machines 
and related to context-free language recognition. 

2 Limited Automata 

Given an integer d > 0, a d-limited automaton is a tuple A = ( Q , £, T, 5, qo, F ), where: 

• Q is a finite set of states. 

• £ and F are two finite sets of symbols, called respectively the input alphabet and 
the working alphabet, such that £ U {>,<]} C F, where >, <1 ^ £ are two special 
symbols, called the left and the right end-markers. 

• 5 : Q X T —> 2^ x ( r \t t> ’ <| h) x t—i s the transition function. 
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• < 7 o £ Q is the initial state. 

• F C Q is the set of final states. 

At the beginning of the computation, the input is stored onto the tape surrounded by the 
two end-markers, the left end-marker being at position zero. Hence, on input w, the right 
end-marker is on the cell in position \w\ + 1. The head of the automaton is on cell 1 and 
the state of the finite control is go. In one move, according to 5 and to the current state, 
A reads a symbol from the tape, changes its state, replaces the symbol just read from 
the tape by a new symbol, and moves its head to one position forward or backward. In 
particular, ( q,X,m ) G S(p,a ) means that when the automaton in the state p is scanning 
a cell containing the symbol a, it can enter the state q, rewrite the cell content by X, 
and move the head to left, if rn = —1, or to right, if m = +1. Furthermore, the head 
cannot violate the end-markers, except at the end of computation, to accept the input, 
as explained below. However, replacing symbols is subject to some restrictions, which, 
essentially, allow the modification of the content of a cell only during the first d visits. 
To this aim, the alphabet T is partitioned into d + 1 sets To,Ti,... ,Td, where To = £ 
and [>, <1 G T^. With the exception of the cells containing the end-markers, which are 
never modified, at the beginning all the cells contain symbols from To = £. In the k- th 
visit to a tape cell, the content of the cell is rewritten by a symbol from r^., up to k = d, 
when the content of the cell is “frozen”, i.e., after that, the symbol in the cell cannot be 
changed further. Actually, on a cell we do not count the visits, but the scans from left 
to right (corresponding to odd numbered visits) and from right to left (corresponding to 
even numbered visits). Hence, a move reversing the head direction is counted as a double 
visit for the cell where it occurs. In this way, when a cell c is visited for the kth time, 
with k < d, its content is a symbol from T^-i. If the move does not reverse the head 
direction, then the content of the cell is replaced by a symbol from THowever, if the 
head direction is reversed, then in this double visit the symbol is replaced by a symbol 
from T fc+ i, when k < d, and by a symbol of T^ that after then is frozen, otherwise. 

Formally, for each G S(p,a), with p, q G Q, a G Tfc, 7 G T^, m G { —1,+1}, we 

require the following: 

• if k = d then a = 7 and k = h, 

• if k < d and m = +1 then h = min( |~|~| -2 + 1 ,d), 

• if k < d and m = — 1 then h = min( |"^± 1 ] • 2 , d). 

An automaton A is said to be limited if it is d-limited for some d > 0 . A accepts an 
input w if and only if there is a computation path which starts from the initial state go 
with the input tape containing w surrounded by the two end-markers and the head on the 
first input cell, and which ends in a final state q G F after violating the right end-marker. 
The language accepted by A is denoted by L(A). A is said to be deterministic whenever 
ffS(q, a) < 1, for any q G Q and hGT. 

Example 1 For each integer k > 1, we denote by the alphabet of k types of brackets, 
which will be represented as {( l5 ) 1; ( 2 , ) 2 ? •• ■) (ki )fc}- The Dyck language D & over the 
alphabet is the set of strings representing well balanced sequences of brackets. We will 
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Figure 1: Some steps in an accepting computation of the automaton M of Algorithm 1 
on input ()(([]))(). 

refer to the ( i symbols as “open brackets” and the ) i symbols as “closed brackets”, i.e. 
opening and closing brackets. 

The Dyck language D *. can be recognized by a machine M which starts having the 
input string on its tape, surrounded by two end-markers D> and < 1 , with the head on 
the first input symbol. From this configuration, M moves to the right to find a closed 
bracket ) i , 1 < i < k. Then M replaces ) i with a symbol X ^ fl*. and changes the head 
direction, moving to the left. In a similar way, it stops when during this scan it meets for 
the first time a left bracket (j. If i ^ j, i.e., the two brackets are not of the same type, 
then M rejects. Otherwise, M writes X on the cell and changes again the head direction 
moving to the right. This procedure is repeated until M reaches one of the end-markers. 
(See Figure 1.) 

• If the left end-marker is reached, then it means that at least one of the right brackets 
in the input does not have a matching left bracket. Hence, M rejects. 

• If instead the right end-marker is reached, then M has to make sure that every left 
bracket has a matching right one. In order to do this, it scans the entire tape from 
the right to the left and if it finds a left bracket not marked with X then M rejects. 
On the other hand, if M reaches the left end-marker reading only Xs, then it can 
accept the input. 

Now we look more into the details of the implementation of this procedure, which is 
summarized in Algorithm 1. 

The machine Ajj starts the computation in the initial state qo (line 1). While moving 
to the right to search for a closed bracket (line 3), Ap does not need to keep in its finite 
control any other information, so it can always use the same state qo. On the other hand, 
after a closed bracket ) i is found, 1 < i < k, Ap needs to remember the index i in order 
to find a matching open bracket (j. This is done by using a state qi for moving to the left 
while performing the search and the next rewriting (lines 6-10). The final loop and test 
(lines 11-15) can be performed using a further state q < ,. Notice that each cell containing 
a closed bracket is rewritten in the first visit, while changing the head direction, and each 
cell containing an open bracket is rewritten in the second visit. Furthermore, the content 
of a cell is not rewritten after the second visit. Hence the machine Ap we just described 
is a deterministic 2 -limited automaton . 6 □ 

6 According to the definition of limited automaton, the alphabet T should be partitioned in three sets 
To = E, Ti, and T 2 , and each open bracket should be rewritten by a symbol of Ih in the first visit. This can 
be trivially done by replacing the open bracket with a marked version. However, for the sake of simplicity, 
here and in the next examples we prefer to avoid these details. 
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Algorithm 1: Recognition of the Dyck language D k 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 


start with the head on the first input symbol 
while symbol under the head ^ <1 do 
move the head to the right 

if symbol under the head = ) i (with 1 < i < k) then 
write X 

repeat 

j move the head to the left 
until symbol under the head ^ X 
if symbol under the head ^ ( i then Reject 
write X 


11 repeat 

12 | move the head to the left 

13 until symbol under the head ^ X 

14 if symbol under the head = t> then Accept 
is else Reject 


Example 2 For each integer n, let us denote by K n the set of all strings over the alpha¬ 
bet {0,1} consisting of the concatenation of blocks of length n, such that at least n blocks 
are equal to the last one. Formally: 

K n = {xix 2 ---x k x | k > 0, xi,x 2 ,...,Xk,x G {0, l} n , 

3*1 < *2 < ■ ■ ■ < i n G {1, • • •, k}, x ix = x i2 = ... = x in = x} . 

We now describe a 2-limited automaton M accepting K n . Suppose M receives an input 
string w of length N. 

1. First, M scans the input tape from left to right, to reach the right end-marker. 

2. M moves its head n + 1 positions to the left, namely to the cell i = N — n, the one 
immediately to the left of the input suffix x of length n. 

3. Starting from this position i, M counts how many blocks of length n coincide with x. 
This is done as follows. 

When M, arriving from the right, visits a position i < N — n for the first time, it 
replaces the content a by a special symbol X , after copying a in the finite control. 
Hence, M starts to move to the right, in order to compare the symbol removed from 
the cell with the corresponding symbol in the block x. While moving to the right, 
M counts modulo n and stops when the counter is 0 and a cell containing a symbol 
other than X is reached.' The symbol of x in this cell has to be compared with a. 
Then, M moves to the left until it reaches cell i — 1, namely the first cell which does 
not contain X, immediately to the left of cells containing X. 

We observe that the end of a block is reached each time a symbol a copied from 
the tape is compared with the leftmost symbol of x, which lies immediately to the 
right of a cell containing X. If in the block just inspected no mismatches have been 

' We remind the reader that M has to recognize the language K n for a fixed integer n. 
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discovered then the counter of blocks matching with x is incremented (unless its 
value was already n). 

4. When the left end-marker is reached, M accepts if and only if the input length is a 
multiple of n and the counter of blocks matching with x contains n. 

We can easily observe that the above strategy can modify tape cells only in the first two 
visits. Hence, it can be implemented by a deterministic 2-limited automaton. Such an 
automaton uses 0 (n 2 ) states and a constant size alphabet. 

Actually, using nondeterminism, it is possible to recognize the language K n using 0(n ) 
states and modifying tape cells only in the first visit, namely K n is accepted by a nonde- 
terministic 1-limited automaton M, which we now describe . 8 M works in three phases. 

1. First, M scans its tape from left to right. During this phase, M marks exactly n + 1 
input cells. The first n marked cells are guessed to be the leftmost positions of the 
n input blocks xq,..., Xi n which are expected to be equal to the rightmost block. 
The last marked cell is guessed to be the leftmost position of the rightmost block. 
This phase can be implemented using n + 2 states, to count how many positions 
have been marked. 

2. When the right end-marker is reached, M makes a complete scan of the input from 
right to left, in order to verify whether or not the input length is a multiple of n, the 
last cell that has been marked in the first phase is the leftmost cell of the last block, 
and the other marked cells are the leftmost cells of some blocks. If the outcome of 
this phase is negative, then M stops and rejects. The number of states used here 
is 2 n. 

3. Finally, M verifies that all the blocks starting from the marked positions contain the 
same string of length n. To this aim, the block from position jh-i is compared symbol 
by symbol with the block from position j/,_, for h = 1 ,..., n, where jo < ji <■■■ < j n 
are the marked positions. To make these comparisons, M with the head on the ?'th 
symbol of the block h — 1 moves the head to the ith symbol of the block h. To detect 
the position of such symbol, M while moving the head to the right counts modulo n, 
until, after visiting a marked cell (namely cell jh ), the value of the counter becomes 0. 
If the comparison fails, then M can reject, otherwise M starts to move to the left, 
still counting modulo n, and searching the leftmost cell of the block h — 1. Then M 
can start to move to the right, while decrementing the counter, reaching cell i when 
the counter is 0 and so locating cell i +1 to start the next comparison. Furthermore, 
the value of the counter also allows the discovery of whether all n symbols of the 
block have been inspected. The implementation of this phase uses 0{n) states. □ 

Computational power 

The following theorem summarizes the most important known results on the computa¬ 
tional power of d-limited automata: 

s The presentation is adapted from a similar example presented in [22], where more details can be found. 




Figure 2: A machine accepting L = h(D k n R ) 


Theorem 3 (i) For each d > 0, the class of languages accepted by d-limited automata 

coincides with the class of context-free languages [8j. 

(ii) The class of languages accepted by deterministic 2-limited automata coincides with 
the class of deterministic context-free languages [23]. 

(Hi) For each d > 2, there exists a language which is accepted by a deterministic d- 
limited automaton, which cannot be accepted by any deterministic (d — 1) -limited 
automaton [8]. 

(iv) The class of languages accepted by 1-limited automata coincides with the class of 
regular languages [29]. 

The argument used by Hibbard to prove Theorem 3(i) is very difficult. He provided 
some constructions to transform a kind of rewriting system, equivalent to pushdown au¬ 
tomata, to 2 -limited automata and vice versa, together with reductions from (ci + 1)- 
limited automata to d-limited automata, for d> 2 [ 8 ]. 

We now discuss a construction of 2-limited automata from context-free languages (al¬ 
ready presented in [22]), which is based on the Chomsky-Schiitzenberger representation 
theorem for context-free languages [2]. We remind the reader that this theorem states 
that each context-free language can be obtained by selecting in a Dyck language D k , 
with k kinds of brackets, only the strings belonging to a regular language R, and then 
renaming the symbols in the remaining strings according to a homomorphism L. More 
precisely, every context-free language LCE* can be expressed as L = h(D k n R), where 
D k C k > 1, is a Dyck language, R C is a regular language, and h : f —> £* is a 
homomorphism. 

Hence, given LCE* context-free, we can consider the following machines: 

• A nondeterministic transducer T computing h -1 . 

• The 2-limited automaton Ap described in Example 1 recognizing the Dyck language 

D k - 

• A finite automaton A }j accepting the regular language R. 

To decide if a string w S E*, we can combine these machines as in Figure 2. 

Now, we discuss how to embed the transducer T, the 2-limited automaton Ap, and the 
automaton An in a unique 2-limited automaton M, provided that the homomorphism h is 
non-erasing , namely h(a) e, for each a E 
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In a first phase, T and Ap work together using a producer-consumer scheme and, after 
that, in a second phase Ar is simulated. In the first phase, when Ap has to examine for 
the first time a tape cell, T produces in a nondeterministic way a symbol o E h~ l (u), 
for a nondeterministically chosen prefix u of the part of the input w which starts from 
X”s current head position. Then, a move of Ap which rewrites o by a new symbol o' is 
simulated. The symbol o' needs to be stored somewhere in the case Ap has to visit the 
same tape cell during the computation. Furthermore, the symbol o will be used for the 
simulation of Ar. Hence, the machine M has to keep both symbols o and o'. Since h 
is non-erasing, this can be done by replacing u by the pair (<r, o') on the tape. More 
precisely, the tape of 2-limited automaton M is divided in two tracks. At the beginning 
of the computation, the first track contains the input w, while the second track is empty. 
In the first phase, the finite control of M simulates the controls of both T and Ap. 
M alternates the simulation of some computation steps of T with the simulation of some 
computation steps of Ap as follows: 

1. When the head reaches a cell which has not yet been visited (hence, also at the 
beginning of the computation), M simulates T, by nondeterministically replacing a 
prefix u of the remaining input, with a string jjl“l -1 (T such that o E /i _1 (u). The 
symbol ft is used for padding purposes. All the cells containing this symbol will be 
skipped in the future steps. 

2. In the last step of the above-described part of computation, when the rightmost 
symbol of u is replaced by a on the first track, M also resumes the simulation 
of Ap, starting from a step reading o. Hence, while writing o on the first track, 
M also writes on the second track the symbol o' which is produced by Ap, while 
rewriting a in the first visit. 

3. If Ap moves to the left, going back to already-visited cells, then M simulates directly 
the moves of Ap, skipping all cells containing ft, and using the second track of the 
tape. When, moving to the right, the head of M reaches a cell which has not been 
visited before, the simulation of Ap is interrupted. (A cell not visited before can 
be located since it does not contain jj and the second track is empty.) In this case, 
M resumes the simulation of T, as explained at point 1, except in the case the cell 
contains the right end-marker. 

4. When the right end-marker is reached, the first track contains a string z E /i _1 (u>), 

while the second track contains the result of the rewriting of 2 by Ap (ignoring all 
the cells containing fl). If Ap rejects, namely, z ^ Dk , then M rejects. Otherwise, 
M moves its head to the left end-marker and, starting from the first tape cell, it 
simulates the automaton Ar , consulting the first track, in order to decide whether 
or not Finally, M accepts if and only if Ar accepts. 

Actually, the simulation of the automaton Ar can be done in the first phase, while simu¬ 
lating T and Ap. In particular, at the previous point 2, when T produces a symbol o and 
M simulates a move of Ap on o, M can also simulate a move of Ar on o. To this aim, 
M has to keep in its finite state control, together with the controls of T and Ap, also the 
control of Ar. This increases the number of the states of M , but makes superfluous having 
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the first track to keep the string z. Hence, it reduces the size of the working alphabet 
of M. 

In this construction, we used the hypothesis that the homomorphism h is non-erasing. 
Actually, the classical proof of the Chomsky-Schiitzenberger representation theorem pro¬ 
duces an erasing homomorphism. However, an interesting variant of the theorem, recently 
obtained by Okhotin, states that for L C E* \ S we can always restrict ourselves to 
non-erasing homomorphisms, namely, we can consider h : fl*. —> X + [18] . 9 

We point out that the construction of 2-limited automata we just outlined always pro¬ 
duces a nondeterministic machine, since the transducer T is intrinsically nondeterministic. 

Actually, the transformation from pushdown automata to 2-limited automaton pro¬ 
vided by Hibbard preserves the determinism. Hence, each deterministic context-free lan¬ 
guage is accepted by a deterministic 2-limited automaton. The converse inclusion, which 
was left open by Hibbard, has been recently proved, thus obtaining that the class of 
languages accepted by deterministic 2-limited automata coincides with the class of deter¬ 
ministic context-free languages (Theorem 3(ii)) [23]. 

It is natural to ask what is the power of determinism in the case of d-limited automata, 
with d > 3. It is not difficult to describe a deterministic 3-limited automaton accepting 
the nondeterministic context-free language { a n b n c \ n > 0} U {a n b 2n d \ n > 0}. Indeed, 
Hibbard showed the existence of an infinite deterministic hierarchy (Theorem 3(iii)). 10 

The simulation of 1-limited automata by finite automata (Theorem 3(iv)) has been 
obtained in [29, Thm. 12.1] by adapting standard techniques involving transition tables. 
These techniques have been introduced for converting two-way automata into equivalent 
one-way automata [24]. 

Descriptional power 

Descriptional complexity aspects related to the results in Theorem 3 have been recently 
investigated providing new conversions between 2-limited automata and pushdown au¬ 
tomata and between 1-limited automata and finite automata. 

Theorem 4 ([23]) 

(i) Each n-state 2-limited automaton can be simulated by a pushdown automaton of size 
exponential in a polynomial in n. 

(ii) The previous upper bound becomes a double exponential when a deterministic 2- 
limited automaton is simulated by a deterministic pushdown automaton, however it 
remains a single exponential if the input of the deterministic pushdown automaton 
is given with a symbol to mark the right end. 

(in) Each pushdown automaton M can be simulated by a 2-limited automaton whose size 
is polynomial with respect to the size of M. 

!, The restriction tCE’\S can be easily removed after observing that one-letter strings can be trivially 
recognized without any rewriting. 

1(, Hibbard proved the separations in Theorem 3(iii) for d > 3. In the case d = 2 the separation follows 
from the fact that the 1-limited automata accept only regular languages. 
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(iv) The previous upper bound remains polynomial when a deterministic pushdown au¬ 
tomaton is simulated by a deterministic 2-limited automaton. 

The exponential gap for the conversion of 2-limited automata into equivalent pushdown 
automata cannot be reduced. In fact, the language K n presented in Example 2 is ac¬ 
cepted by a (deterministic) 2 -limited automaton with 0(n 2 ) states and a constant size 
alphabet, while the size of each pushdown automaton accepting it must be at least expo¬ 
nential in n [23]. This also implies that the simulation of deterministic 2-limited automata 
by deterministic pushdown automata is exponential in size. Actually, we conjecture that 
this simulation costs a double exponential, namely it matches the upper bound in Theo¬ 
rem 4(ii). 

Theorem 5 ([22]) Each n-state 1-limited automaton M can be simulated by a nondeter- 

2 n 2 

ministic automaton with n ■ 2 n states and by a deterministic automaton with 2 n ' 2 states. 
Furthermore, if M is deterministic then an equivalent Idfa with no more than n • (re + l) n 
states can be obtained. 

The doubly exponential upper bound for the conversion of nondeterministic 1-limited 
automata into deterministic automata is related to a double role of nondeterminism in 1 - 
limited automata. When a 1-limited automaton visits one cell after the first rewriting, the 
possible nondeterministic transitions depend on the symbol that has been written in the 
cell in the first visit, which, in turns, depends on the nondeterministic choice taken in the 
first visit. This double exponential cannot be avoided. In fact, as we already observed, 
the language K n of Example 2 is accepted by a nondeterministic 1-limited automaton 
with 0(n ) states and, using standard distinguishability arguments, it can be shown that 
each deterministic automaton accepting it requires a number of states doubly exponential 
in n. As observed in [ 22 ], even the simulation of deterministic 1-limited automata by 
two-way nondeterministic automata is exponential in size. 

It should be interesting to investigate the size costs of the simulations of d-limited 
automata by pushdown automata for d > 0. 

We conclude this section by briefly mentioning the case of unary languages. It is well- 
known that in this case each context-free language is regular [ ]. Hence, for each d > 0, 
d-limited automata with a one letter input alphabet recognize only regular languages. 
In [22], a result comparing the size of unary 1-limited automata with the size of equivalent 
two-way nondeterministic finite automata has been obtained. Quite recently, Kutrib and 
Wendlandt proved state lower bounds for the simulation of unary d-limited automata by 
different variants of finite automata [ 12 ]. 

3 Strongly Limited Automata 

In Section 2, using the Chomsky-Schiitzenberger representation theorem for context-free 
languages, we discussed how to construct a 2 -limited automaton accepting a given context- 
free language. The main component of such an automaton is a 2-limited automaton ac¬ 
cepting a Dyck language D However, Algorithm 1 described in Example 1 recognizes Dk 
without fully using the capabilities of 2-limited automata. For instance, it does not need 
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to rewrite each tape cell two times. So, we can ask if it is further possible to restrict the 
moves of 2-limited automata, still keeping the same computational power. 

In [21] we gave a positive answer to this question, by introducing strongly limited 
automata, a restriction of limited automata which closely imitates the moves which are 
used in Algorithm 1. In particular, these machines satisfy the following restrictions: 

• While moving to the right, a strongly limited automaton always uses the same 
state qo until the content of a cell (which has not been yet rewritten) is modified. 
Then it starts to move to the left. 

• While moving to the left, the automaton rewrites each cell it meets that is not yet 
rewritten up to some position where it starts again to move to the right. Further¬ 
more, while moving to the left the automaton does not change its internal state. 

• In the final phase of the computation, the automaton inspects all tape cells, to 
check whether or not the final content belongs to a given 2-strictly locally testable 
language. Roughly, this means that all the factors of two letters of the string which 
is finally written on the tape 11 should belong to a given set. 

We now present the detailed definition of this model and then we discuss it. 

Definition 1 A strongly limited automaton is a tuple M. = (Q, £, T, 6, qo, (/>), where: 

• Q is a finite set of states, which is partitioned in the three disjoint sets {go}; Ql, 
and Qr- 

• £ and T are two finite and disjoint sets of symbols, called respectively the input 
alphabet and the working alphabet of Ai. Let us denote by T the global alphabet 
of M defined as T = £ U T U {>, <]}, where >,<] ^ £UT are, respectively, the left 
and the right end-marker. 

• 5 : Q x Y —> 2^““' 1 1 x e T,q e Q} ^ ^/ ie transition function, which 

associates a set of possible operations with each configuration of M.. 

• qo is the initial state. 

• g> G Qr is the final state. 

The transition function 5 has to satisfy the conditions listed below. 

• For the state qo: 

- 5{qo,a ) = {---►} if a G T, 

— 5(qo,a) C {--->} U q G Ql,X G T} i/ a G £, 

- <J(g 0 ,<) = W— 1 }; 

— 5(qo,>) is undefined. 

• For each state q G Ql: 

11 The string which is considered includes the end-markers. 
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- S(q,a) = {<---} if a G F, 

- 5(q,a ) c ^+ qo \ X G T} if a G S, 

— S(q,a) is undefined if a & {>,<]}. 

• For each state q x G Qr •' 

- S(qx,a) Q {ga^- 1 }^ where a G T. 

We now describe how A4 works, providing an informal explanation of the meaning of 
the states and of the operations that A4 can perform. First of all, we assume that at the 
beginning of the computation the tape contains the input string w G £* surrounded by the 
two end-markers. Tape cells are counted from 0. Hence, cell 0 contains > and cell |w| + 1 
contains <3. The head is on cell 1, namely scanning the leftmost symbol of w, while the 
finite control is in <70 - 

The initial state qo is the only state which is used while moving from left to right. 
In this state all the cells that have been already rewritten are ignored, just moving one 
position further, while on all the other cells A4 could be allowed either to move to the 
right or to rewrite the cell content and then turn the head direction to the left, entering a 
state in the set Ql■ To this aim, in the state qo the following operations could be possible: 

• Move to the right ---> 

Move the head one position to the right without rewriting the cell content and without 
changing the state. 

• Turn to the left 

Write X G T in the currently scanned tape cell, move one position to the left, entering 
in state q G Ql- After a sequence of moves from left to right, with this operation 
A4 rewrites the content of the current cell and changes the head direction, entering 
a state q G Ql- 

We point out that these two operations are not allowed in states other than qo- One 
further operation ( 9 < -<— 1 , described later) is possible in qo, when the right end-marker is 
reached, to activate the final phase of the computation. 

The states in the set Ql are used to move to the left. In a state q G Ql, the au¬ 
tomaton A4 ignores all the cells that have been already rewritten, just moving to the left. 
On the remaining cells that A4 visits, it always rewrites the content up to some position 
where it turns its head to the right. During this procedure, M changes state only at the 
end, when it enters again in qo. In a state q G Ql the following operations can be allowed: 

• Move to the left <--- 

Move the head one position to the left without rewriting the cell content and without 
changing the state. This move is used only on cells that have been rewritten. 

• Write and move to the left 

Write X G T in the currently scanned tape cell, move one position to the left, without 
changing the state. This move can be used only on cells not yet rewritten. 

• Turn to the right 

Write X G T in the currently scanned tape cell, move one position to the right, 
entering in the state <70 • Even this move can be used only on cells not yet rewritten. 
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If the left end-marker is reached while scanning to the left in a state of Ql then the 
computation stops by rejecting (technically the next transition is undefined). On the 
other hand, if the right end-marker is reached while scanning to the right in qo, the 
machine starts a final phase where it completely scans the tape from right to left and then 
stops. During the last phase M checks the membership of the final tape content to a 
local language. 12 If some forbidden factor is detected then the next transition is undefined 
and hence M rejects. To this aim, in this phase only states from the set Q y are used. 
We assume that there is a surjective map from T to Qr • We simply denote as q x the 
state associated with the symbol X £ T. Note that X / Y does not implies q x / q Y . The 
following operation is used in this phase: 

• Check to the left —i 

On a cell containing symbol a E T, move to the left remembering the state associated 
with a. 

If no forbidden factor is found, Ai finally violates the left end-marker in the state q >. In 
this case the input is accepted. Otherwise the computation of Ad stopped in some previous 
step, rejecting the input. Hence, we assume that At accepts its input if and only if from 
the cell containing the left end-marker it can further move to the left entering the final 
state . 

Example 6 Consider the alphabet D 2 , with brackets represented by the symbols (,),[,]. 
The Dyck language D 2 is accepted by a strongly limited automaton with T = {x}, 
Ql = {< 7 i, <? 2 }•, Qr = {< 7 x 5 < 7 t>, <?<}, and the following transitions (we omit braces for the 
sake of the brevity): 

• %o, () = %o, [) =—H S(qo, )) =qi+ JL , %0, ]) = 92+^, 

• S(qi,X) = 5(q 2 ,X) =<■ — , 5(q 1 , () = 8(q 2 , [) = ^g Q , 

• %0,<) = <?<<—', %<u x ) =<%x,X) = q ^— h, 8(q x ,>) = g> -<— 1 . 

It can be observed that the states <?<, fto </>> used in the final scan, can be merged in a 
unique state q^>. In fact, in this example the purpose of the final scan is to check that all 
the input symbols have been rewritten, namely, no symbol a G {(,),[,]} is left on the 
tape. If such a symbol is discovered, then the next transition is not defined and hence the 
computation rejects. □ 

Example 7 The deterministic context-free language {a n b 2n \ n > 0} is accepted by a 
strongly limited automaton which guesses each second b. While moving from left to right 
and reading b, the automaton makes a nondeterministic choice between further moving to 
the right or rewriting the cell by X and turning to the left. Furthermore, while moving 
to the left, the content of each cell containing b which is visited is rewritten by Y, still 

12 A regular language L is said to be strictly locally testable if there is an integer k such that membership 
of a string x to L can be “locally” verified by inspecting all factors of length k in x [ ]. In the case k = 2 

we simply say that the language is local. More precisely, given an alphabet A and two extra symbols 
>, < (f A, we say that a language L C A* is local if and only if there exists a set F C (Au {>, <}) 2 of 
forbidden factors such that a string x £ A* belongs to L if and only if no factor of length 2 of >®< belongs 
to F. 
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moving to the left, and when a cell containing a is visited, its content is replaced by Z, 
turning to the right. In the final scan the machine accepts if and only if the string on the 
tape belongs to >Z*(YX)*<I. 

We can modify the above algorithm to recognize the language {a n b n \ n > 0}U {a n b 2n \ 
n > 0}. While moving from left to right, when the head reaches a cell containing b three 
actions are possible: either the automaton continues to move to the right, without any 
rewriting, or it rewrites the cell by Z, turning to the right, or it rewrites the cell by W, also 
turning to the right. While moving from right to left, the automaton behaves as the one 
above described for {a n b 2n \ n > 0}. The input is accepted if and only if the string which 
is finally on the tape belongs to >Z*W*<I + >Z*(YX)*<. □ 

We already mentioned that strongly limited automata have the same computational 
power as limited automata, namely they characterize context-free languages. This result 
has been proved in [ 21 ], also studying descriptional complexity aspects: 

Theorem 8 ([21]) 

(i) Each context-free language L is accepted by a strongly limited automaton whose de¬ 
scription has a size which is polynomial with respect to the size of a given context-free 
grammar generating L or of a given pushdown automaton accepting L. 

(ii) Each strongly limited automaton M. can be simulated by a pushdown automaton of 
size polynomial with respect the size of M.. 

The proof of (i) was obtained using a further variant of the Chomsky-Schiitzenberger 
representation theorem, also proved by Okhotin [18]. In this variant Dyck languages 
extended with neutral symbols and letter-to-letter homomorphisms are used, (ii) has 
been proved by providing a direct simulation. 

Concerning deterministic computations, it is not difficult to observe that deterministic 
strongly limited automata cannot recognize all deterministic context-free languages. Con¬ 
sider, for instance, the deterministic language L = {ca n b n \ n > 0} U {da 2n b n \ n > 0}. 
While moving from left to right, a strongly limited automaton can use only the state qo- 
Hence, it cannot remember if the first symbol of the input is a c or a d and, then, if it has 
to check whether the number of as is equal to the number of bs or whether the number 
of as is two times the number of 6 s. A formal proof that the language L, and also the 
language {a n 6 2n | n > 0} (Example 7), are not accepted by any deterministic strongly 
limited automaton is presented in [ 21 ]. 

In that paper it was also proposed to slightly relax the definition of strongly limited 
automata, by introducing a set of states Qr, with qo 6 Qr , used while moving to the 
right, and allowing transitions between states of Qr and of Qr while moving to the left 
and to the right, respectively, but still forbidding state changes on rewritten cells and by 
keeping all the other restrictions. This model, called almost strongly limited automata, 
still characterizes the context-free languages. Furthermore, the two deterministic context- 
free languages mentioned in the previous paragraph can be easily recognized by almost 
strongly limited automata having only deterministic transitions. 

It would be interesting to know if almost strongly limited automata are able to accept 
all deterministic context-free languages without taking nondeterministic decisions. 
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4 Conclusion 


We discussed some restricted versions of one-tape Turing machines characterizing context- 
free languages. Some other interesting models are presented in the literature. We briefly 
mention some of them. 

In 1996 Jancar, Mraz, and Platek introduced forgetting automata [10]. These devices 
can erase tape cells by rewriting their contents with a special symbol. However, rewritten 
cells are kept on the tape and are still considered during the computation. For instance, 
the state can be changed while visiting an erased cell. In a variant of forgetting automata 
that characterizes context-free languages, when a cell which contains an input symbol is 
visited while moving to the left, its content is rewritten, while no changes can be done 
while moving to the right. This way of operating is very close to that of strongly limited 
automata. However, in strongly limited automata the rewriting alphabet can contain 
more than one symbol. Furthermore, rewritten cells are completely ignored (namely, the 
head direction and the state cannot be changed while visiting them) except in the final 
scan of the tape from the right to the left end-marker. So the two models are different. 
For example, to recognize the set of palindromes, a strongly limited automaton needs a 
working alphabet of at least 3 symbols while, by definition, to rewrite tape cells forgetting 
automata use only one symbol [21]. 

If erased cells are removed from the tape of a forgetting automaton, we obtain another 
computational model called deleting automata. This model is less powerful. In fact it is 
not able to recognize all context-free languages [10]. 

Wechsung proposed another complexity measure for one-tape Turing machines called re¬ 
turn complexity [30, 31]. This measure counts the maximum number of visits to a tape 
cell, starting from the first visit which modifies the cell content. It should be clear that 
return complexity 1 characterizes regular languages (each cell, after the first rewriting, 
will be never visited again, hence the rewriting is useless). Furthermore, for each d > 2, 
return complexity d characterizes context-free languages. Notice that this measure is dual 
with respect to the one considered to define limited automata. 13 Even with respect to 
return complexity, there exists a hierarchy of deterministic languages (cf. Theorem 3(iii) 
in the case of limited automata). However, this hierarchy is not comparable with the 
class of deterministic context-free languages. For instance, it can be easily seen that the 
set of palindromes, which is not a deterministic context-free language, can be recognized 
by a deterministic machine with return complexity 2. However, there are deterministic 
context-free languages that cannot be recognized by any deterministic machine with return 
complexity d, for any integer d [19]. 

With the aim of investigating computations with very restricted resources, Hemas- 
paandra, Mukherji, and Tantau studied one-tape Turing machine with absolutely no space 
overhead , a model which is very close to “realistic” computations, where the space is mea¬ 
sured without any hidden constants [6]. These machines use the binary alphabet £ = {0,1} 
(plus two end-marker symbols) and only the portion of the tape which at the beginning of 
the computation contains the input. Furthermore, no other symbols are available, namely 
only symbols from £ can be used to rewrite the tape. Despite these strong restrictions, 

13 Indeed, the maximum number of visits to a cell up to the last rewriting, namely the measure used to 
define limited automata, is sometimes called dual return complexity [29]. 
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there machines are able to recognize in polynomial time all context-free languages over X. 
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